A Methodological Approach to Data Quality Management Supported by Data Mining
نویسندگان
چکیده
In this paper, we use the example of the car manufacturing domain to illustrate how data quality problems are addressed in practice today. We then propose a process model for data quality management (DQM) which meets the requirements of the current ISO 9001 standard and thus introduces a methodological, processoriented approach to DQM. Data mining methods that are typically applied to find interesting and previously unknown patterns in large amounts of data are being used to support several phases of this process model. The main idea behind the application of data mining methods is to deem data anomalies deviations from a ‘normal’ quality state. The primary advantage of our approach is an increased degree of automation and enhanced thoroughness and flexibility of DQM. 1. Data Quality Challenges in Automotive Manufacturing While product quality has always been a central focus at DaimlerChrysler, data quality has not yet received the attention it deserves. This does not mean that data quality has been completely neglected thus far, but most data quality initiatives have had solely a strong local focus. With the growing demand for the integration of distributed, heterogeneous databases into corporate warehouse applications, data deficiencies have become obvious, necessitating corporate-wide DQM to address data quality issues across system boundaries. This global data quality view implicates additional data quality perspectives and presents challenges regarding the related tools and methodologies. As correcting data already stored in a database system is much more expensive than setting up appropriate measures to prevent substandard-quality data to be entered into the systems, precautions should be given top priority. However, as real environments are complex, there will be always a need for measuring, monitoring, and improving the quality of data after it has initially been stored. This was one of the motivations for initiating a research project which focuses on the application of data mining technologies in the context of DQM. We have introduced the term Data Quality Mining, which we believe to have great potentials for both future research work and a successful transfer of data mining technology into daily work processes. In section 2, we propose a quality management system for data integration that meets the requirements of the current ISO 9001 standard. Section 3 presents a case study where a subset of these concepts has been applied to the QUIS (QUality Information System) database of the Global Services and Parts division of the Group using data mining techniques. Finally, we give an overview of related work and further research issues in section 4. Proceedings of the Sixth International Conference on Information Quality
منابع مشابه
Modelling Customer Attraction Prediction in Customer Relation Management using Decision Tree: A Data Mining Approach
In Today’s quality- based competitive world, known as knowledge age, customer attraction is of ultimate importance. In respect to the slogan “customer is always right”, customer relation management is the core of an organizational strategy playing an important role in four aspects of customer identification, customer attraction, customer retaining, and customer satisfaction. Commercial organiza...
متن کاملCustomer Retention Based on the Number of Purchase: A Data Mining Approach
Purpose: this study wants to find any relationship between the numbers of purchase and the income the customer brings to the company. The attempt is to find those customers who buy more than one life insurance policy and represent the signs of good payments at the same time by the help of data mining tools. Design/ methodology/ approach: the approach of this research is to use data mining tools...
متن کاملA data mining approach to employee turnover prediction (case study: Arak automotive parts manufacturing)
Training and adaption of employees are time and money consuming. Employees’ turnover can be predicted by their organizational and personal historical data in order to reduce probable loss of organizations. Prediction methods are highly related to human resource management to obtain patterns by historical data. This article implements knowledge discovery steps on real data of a manufacturing pla...
متن کاملApplying a decision support system for accident analysis by using data mining approach: A case study on one of the Iranian manufactures
Uncertain and stochastic states have been always taken into consideration in the fields of risk management and accident, like other fields of industrial engineering, and have made decision making difficult and complicated for managers in corrective action selection and control measure approach. In this research, huge data sets of the accidents of a manufacturing and industrial unit have been st...
متن کاملIdentification of the Patient Requirements Using Lean Six Sigma and Data Mining
Lean health care is one of new managing approaches putting the patient at the core of each change. Lean construction is based on visualization for understanding and prioritizing imporvments. By using only visualization techniques, so much important information could be missed. In order to prioritize and select improvements, it’s essential to integrate new analysis tools to achieve a good unders...
متن کامل